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Abstract 

We present a nonlinear stochastic differential equation (SDE) which mimics the 
probability density function (PDF) of the return and the power spectrum of the ab- 
solute return in financial markets. Absolute return as a measure of market volatility 
is considered in the proposed model as a long-range memory stochastic variable. 
The SDE is obtained from the analogy with earlier proposed model of trading ac- 
tivity in the financial markets and generalized within the nonextensive statistical 
mechanics framework. The proposed stochastic model generates time series of the 
return with two power law statistics, i.e., the PDF and the power spectral density, 
reproducing the empirical data for the one minute trading return in the NYSE. 
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1 Introduction 



High frequency time series of financial data exhibit sophisticated statisti- 
cal properties. What is the most striking is that many of these anomalous 
properties appear to be universal. Vast amounts of historical stock price data 
around the world have helped to establish a variety of so-called stylized facts 
P12][5]IHI5|5] . which can be seen as statistical signatures of financial processes. 
The findings as regards the PDF of the return and other financial variables are 
successfully generalized within a non-extensive statistical framework The 
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return has a distribution that is very well fitted by g-Gaussians, only slowly 
becoming Gaussian as the time scale approaches months, years and longer 
time horizons. Another interesting statistic which can be modeled within the 
nonextensive framework, is the distribution of volumes, defined as the number 
of shares traded. 

Interesting stochastic models related to the nonextensive statistics include an 
ARCH process with random noise distributed according to a g-Gaussian as well 
as some state-dependent additive- multiplicative processes [8j. These models do 
capture the distribution of returns, but not necessarily the empirical temporal 
dynamics and correlations. Additive-multiplicative stochastic models of the 
financial mean-reverting processes provide a rich spectrum of shapes for the 
probability distribution function (PDF) depending on the model parameters 
[Ij. Such stochastic processes model the empirical PDF's of volatility, volume 
and price returns with success when the appropriate fitting parameters are 
selected. Many other fits are also proposed, including exponential ones [10] 
applicable for larger time scales. 

Nevertheless, there is a necessity to select the most appropriate stochastic 
models, able to describe volatility as well as other variables in dynamical 
aspects and long-range correlation aspects. 

There is empirical evidence that trading activity, trading volume, and volatility 
are stochastic variables with the long-range correlation [TT]|12fl3] and this key 
aspect is not accounted for in some widely used models. The ARCH-like, 
multiscale models of volatility, which assume that the volatility is governed by 
the observed past price changes over different time scales, have been recently 
proposed p3fT5] . Trading volume and trading activity are positively correlated 
with market volatility. Moreover, trading volume and volatility show the same 
type of long memory behavior [12] . 

Recently we investigated analytically and numerically the properties of stochas- 
tic multiplicative point processes [T7fT8] . derived a formula for the power 
spectrum and related the model with the general form of the multiplicative 
stochastic differential equation [T^PU] . The extensive empirical analysis of the 
financial market data, supporting the idea that the long-range volatility cor- 
relations arise from trading activity, provides valuable background for further 
development of the long-ranged memory stochastic models [T2|l3j . The power 
law behavior of the autoregressive conditional duration process [21] based on 
the random multiplicative process and its special case the self-modulation pro- 
cess [22], exhibiting 1// fluctuations, supported the idea of stochastic modeling 
with a power law PDF and long memory. A stochastic model of trading activ- 
ity based on an SDK driven Poisson-like process has been already presented 
in [23j. We further develop an approach of modulating the SDK with a closer 
connection to the nonextensive statistics in order to model the dynamics of 
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return in this paper. 



Long memory (long-term dependence) has been defined in time domain in 
terms of autocorrelation power law decay, or in frequency domains in terms of 
power law growth of low frequency spectra. Despite statistical methodology 
being developed for the data with the long-range dependence and the solid 
mathematical foundations of the area ^5], let us consider behavior of the 
financial variables only in the frequency domain, analyzing the power spectral 
density. 

In the second Section we present the nonlinear SDE generating a signal with 
a g-Gaussian PDF and power law spectral density. In the third Section we 
analyze the tick by tick empirical data for trades on the NYSE for 24 shares 
and adjust the parameters of the proposed equations to the empirical data. A 
short discussion and conclusions are presented in the final section. 



2 The stochastic model with a g-Gaussian PDF and long memory 

Earlier we investigated stochastic processes with long-range memory proper- 
ties. Starting from the stochastic point process model, which reproduced a vari- 
ety of self-affine time series exhibiting the power spectral density S{f) ~ 1//^ 
scaling as power /5 of the frequency / [[ISj, later we introduced a Poisson-like 
process driven by the stochastic differential equation. The latter served as an 
appropriate model of trading activity in the financial markets [23J. In this 
section we generalize an earlier proposed nonlinear SDE within the nonex- 
tensive statistical mechanics framework to reproduce the long-range memory 
statistics with a g-Gaussian PDF. The g-Gaussian PDF of stochastic variable 
r with variance cr^ can be written as 



where Aq is a constant of normalization and g defines the power law part of 
the distribution. P(r) is introduced through the variational principle applied 
to the generalized entropy |8j 




(1) 



Sq — k 



Here the g-exponential of variable x is defined as 



expq{x) = (1 + (1 - q)x) 



l-q 



(2) 
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and we assume that the g-mean /ig = 0. With some transformation of param- 
eters (Tg and q 



2 [3^ 



q — I \ q — 1 



we can rewrite the g-Gaussian in a more transparent form: 

A 



P .(r) - (^] ' (3) 



Looking for the appropriate form of the SDE we start from the general case 
of a multiphcative equation in the Ito convention with Wiener process W: 

dr = a{r)dt + b{r)dW. (4) 



If the stationary distribution of SDE (jlj) is the g-Gaussian ([3]), then the coef- 
ficients of SDE are related as follows |23]: 

«W = -^^Kr)^ + Kr)^. (5) 



From our previous experience modeling one-over-f noise and trading activity in 
financial markets pTfTS] . building nonlinear stochastic differential equations 
exhibiting power law statistics p^][20] . we know that processes with power 
spectrum S{f) ~ 1//^ can be obtained using the multiplicative term b{r) ~ 
or even a slightly modified form (rg -|-r^) 2 . Therefore, we choose the term b{r) 
as 

6(r) =cr(r2 + r2)i (6) 



and, consequently, by Eq. ([5]) we have the related relaxation 

a(r) = a'(^v- ^) {r^ + r^-'r. (7) 

Then one gets the stochastic differential equation 

dr = a^(r]- ^ J (r^ + r^'^rdt + (T(r^ + r'')UW. (8) 
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Note that in the simple case = 1 Eq. ([8]) coincides with the model presented 
in the article by Queiros et al. [26] with 



h{r) 



\ P{r)i 



1 ' «(o = -4 f ^ - iV (9) 



We will investigate higher values of rj in order to cache long-range memory 
properties of the absolute return in the financial markets. We can scale our 
variables 

x = -, ts = a^rl^''~^h (10) 



to reduce the number of parameters and to get simplified equations. Then 
SDE 

^{l + x^Y-^x<lt, + {l + x^)'idWs (11) 




describes a stochastic process with a stationary g-Gaussian distribution 

P^i^)-^^^m^.(r^Y (12) 



0Fr(A/2- 1/2) \l + x 
and the power spectral density of the signal S{f) 



+ ^ (13) 

(A-l)r(/?-l/2) f 2 + \-2r, V-' 
20F(r7 - 1) sin(7r/5/2) \ 2ti ) ^ ' 

with 0.5 < /3 < 2, 4 - 7] < A < 1 + 27] and ?7 > 1. Eqs. ffTlfTD were first 
derived for the multiplicative point process in [T7|IT8] and generalized for the 
nonlinear SDE ([8]) in [T91I20] . Although Eq. ([8]) coincides with Eq. (15) in ref. 
[20] only for high values of the variable r >> ro, these values are responsible 
for the power spectrum. Note that the frequency / in equation f[T^ is the 
scaled frequency matching the scaled time tg ffTUl) . The scaled equations ffTU]) - 
(fT^ define a stochastic model with two parameters A and rj responsible for 
the power law behavior of the signal PDF and power spectrum. Numerical 
calculations with Eq. (fTTj) confirm analytical formulas (11211141) (see ref. |20j). 

We will need a more sophisticated version of the SDE to reproduce a stochas- 
tic process with a fractured power spectrum of the absolute return observable 
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in financial markets. Having in mind the statistics of tlie stocliastic model 
(11 II) defined by Eqs. (fT2|) - (fT^ and numerical modeling with more sophisti- 
cated versions of the SDE, we propose an equation combining two powers of 
multiplicativity 

dx=(v--- {xe'^A xdts + — ^i±^l^diy,. (15) 



Here e divides area of x diffusion into two different power law regions to ensure 
the spectral density of |x| with two power law exponents. A similar procedure 
has been introduced in the model of trading activity [23j. The proposed new 
form of the continuous stochastic differential equation enables us to reproduce 
the main statistical properties of the return observed in the financial markets. 
This provides an approach to the market with behavior dependent on the 
level of activity and exhibiting two stages: calm and excited. Equation ( |T5i) 
models the stochastic return x with two power law statistics, i.e., the PDF 
and power spectral density, reproducing the empirical power law exponents of 
the trading return in the financial markets. At the same time, via the term 
{xe'^y we introduce the exponential diffusion restriction for the high values of 
X as the markets in the excited stage operate on the limit of nonstationarity. 
We solve Eq. (IT^ numerically using the method of discretization. Introducing 
the variable step of integration 



the differential equation f|T5l) transforms to the difference equation 



Xk+i=Xk + It]-^ - {xe'')A Xk + K,{xl + l)2ek 



■ tk + K- 



{Xl + 1)'^'! 



(16) 
(17) 



The continuous stochastic variable x does not include any time scale as the 
return defined in a time window r should. Having in mind that the return is an 
additive variable and depends on the number of transactions in a similar way 
to trading activity, we define the scaled return X in the time period r as the 
integral of the continuous stochastic variable X = ^l^"^ x{ts) /t dts- Note that 
T here is measured in scaled time units Eq. (fTOl) and will coincide with a one 
minute interval of empirical data. This serves as an procedure of adjustment 
to the real time scale for scaled equations. 
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Fig. 1. (a) The numerically calculated PDF of |X| = | //^^ x[t) jr dt^l from Eq. p7|) 
(black thin line), in comparison with the theoretical distribution 2P{x) Eq. (jl2p 
(gray thick line), and (b) the numerically calculated power spectrum of \X\. Param- 
eters r] = 5/2, A = 3.6, r = 0.0001 and e = 0.01 are selected to reproduce statistics 
for the absolute return in financial markets. 

It is worth recalling that integration of the signal in the time interval r does 
not change the behavior of the power spectrum for the frequencies f « K 
This is just the case we are interested in for the long-range memory analysis of 
financial variables and we can expect Eqs. (1131114^ to work for the stochastic 
variable X as well. We analyzed the influence of signal integration on the 
PDF in previous modeling of trading activity; see ref. |l7j. Integration of the 
nonlinear stochastic signal increases the exponent of the power law tails in 
the area of the highest values of the integrated signal. This hides fractured 
behavior of the X PDF, which arises for x as a consequence of the two powers 
in the multiplicative term of Eq. ( fT5l) . 

In Fig. 1 we demonstrate (a) the numerically calculated PDF of |X| in compar- 
ison with the theoretical distribution 2P{x) Eq. ( |T2l) and (b) the numerically 
calculated power spectrum of |X| with parameters appropriate for reproducing 
statistics for the absolute return in financial markets. 



3 Empirical analysis and model adjustment 

In this section we analyze the tick by tick trades of 24 stocks, ABT, ADM, 
BMY, C, CVX, DOW, FNM, GE, GM, HD, IBM, JNJ, JPM, KO, LLY, MMM, 
MO, MOT, MRK, SLE, PFE, T, WMT, XOM, traded on the NYSE for 27 
months from January, 2005, recorded in the Trades and Quotes database. 
We sum empirical tick by tick returns into one-minute returns to adjust the 
continuous stochastic model presented. There is a problem in the use of a 
straightforward procedure to determine t] from empirical data. One expects 
to have 77 ~ 1 when the return is assumed as a simple stochastic variable |26j . 
From our point of view the straightforward SDE recovery procedures do not 
work, as the return in real financial markets is at least double the stochas- 
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Fig. 2. An example of a moving average for 60 min of empirical absolute returns per 
minute (gray thick line) in comparison with the corresponding moving average of 
trading activity, number of trades per minute (black thin line). Scales are adjusted. 

tic process influenced by long memory stochastic trading activity and rapid 
price fluctuations. On the other hand, if one assumed f] = 1, then long-range 
memory features of the process would be lost [20]. Earlier we investigated the 
nonlinear stochastic equations with rj > 3/2, exhibiting the long-range mem- 
ory properties [19], and proposed one as an appropriate stochastic model of 
trading activity in the flnancial markets [23]. Detailed analysis of the empiri- 
cal data from the NYSE provides evidence that long-range memory properties 
of the return strongly depend on fluctuations of trading activity. In Fig. 2 
we demonstrate strong correlation of the moving average of absolute returns 
per minute with the moving average of trading activities (number of trades 
per minute). Here for the empirical sequences of one-minute returns {^jj^j^ 
or trading activities {Nt}^^^ we calculate moving averages MA deflned as the 
centered means for a selected number of minutes n; for example, MA(rt) is 

t+n/2-l 

MA(r,) = - J2 (18) 

^ j=t-n/2 



The best correlation can be achieved when the moving averages are calculated 
in the period from 60 to 100 minutes. 

There are a lot of researchers investigating the power law distribution of re- 
turns and trading activity in the flnancial markets ^27] . The g-Gaussian PDF 
is a reasonable approximation to the empirical data [7]. The power law ex- 
ponents for the extreme values of returns and trading activity are nearly the 
same: A ~ 4 [23J . Furthermore, fascinating statistical similarity of two flnancial 
variables occurs in the power spectral density exhibiting long-range memory 
properties with two scaling exponents |23]. All these extraordinary sophisti- 
cated statistical properties are reproducible using the SDE ( IT^ introduced in 
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the previous section. 



Many non-equilibrium systems exhibit spatial or temporal fluctuations of some 
parameter. There are two time scales: the scale on which the dynamics is able 
to reach a stationary state, and the scale for which the fluctuating parameter 
evolves. A particular case is when the time needed for the system to reach 
stationarity is much smaller than the scale at which the fluctuating parameter 
changes. In the long term, the non-equilibrium system is described by the 
superposition of different local dynamics at different time intervals, which has 
been called superstatistics [281129] . 




In order to account for the double stochastic nature of return fluctuations - 
a hidden slowly diffusing long-range memory process and rapid fluctuations 
of the instantaneous price changes — we decompose the empirical one-minute 
return series into two processes: the background fluctuations and the high am- 
plitude rapid fluctuations dependent on the first one modulating. To perform 
this decomposition we assume that the empirical return can be written as 
instantaneous g-Gaussian fluctuations with a slowly diffusing parameter tq 
dependent on the moving average of the return rf. 



where ^{ro,A2} is a g-Gaussian stochastic variable with the PDF defined by 
Eq. ([3]) (the parameter q is q = I-I-2/A2). In Eq. (HM the parameter Tq depends 
on the modulating moving average of returns, MA(rt), and the empirically 
defined power law exponent A2. From the empirical time series of the one- 
minute returns one can draw histograms of C, corresponding to defined values 
of the moving average MA(rt). The g-Gaussian PDF is a good approximation 
to these histograms and the adjusted set of tq for selected values of MA(r() 
gives an empirical definition of the function 



The g-Gaussians with A2 = 5 and linear function ro(|MA(rt)|) (1201) give a 
good approximation of ^ fluctuations for all stocks and values of modulating 
MA{rt). The long-term PDF of moving average MA(r() can be approximated 
by a g-Gaussian with fo = 0.2 and A = 3.6. All these empirically defined 
parameters form the background for the stochastic model of the return in the 
financial market. 

We propose to model the long-range memory modulating stochastic return 
MA(rt) by X = fo //+^a;(t)/r dt, where a: is a continuous stochastic variable 
defined by Eq. (fTSjl . The remaining parameters e and r can be adjusted to the 
empirical data and have values e = 0.01 and r = O.OOOl/cr^ = 60 s. In Fig. 3 we 




r = e{ro(MA(n)),A2}, 



(19) 



ro(MA(rt)) = 1 + 2.5 x |MA(rt)|. 



(20) 



9 




Sif) 



10' 



Iff 



(b) 





10 100 10^'' 10'^ 10'-' 10'^ 10 



-/ 



Fig. 3. Comparison of empirical and model (|15|) - (j2U|) statistics of one-minute returns 
traded on the NYSE, (a) the empirical (black thin line) and model (gray thick line) 
PDF of normalized returns and (b) the empirical power spectrum of normalized 
returns (black thin line) averaged over 24 stocks and the model power spectrum 
(gray thick line) averaged over 24 realizations. All parameters are as follows: A2 = 5; 
fo = 0.2; T = 0.0001/^2 = 60s; A = 3.6; e = 0.01 and r] = 5/2. 



provide a comparison of the empirical PDF, averaged over 24 stocks of NYSE, 
of one-minute returns normalized to the standard deviation and the power 
spectrum with the corresponding statistics of the proposed double stochas- 
tic model. This serves as an evidence of possibility of modeling the financial 
variables using nonlinear stochastic equations with elements of nonextensive 
statistics. Noticeable difference in theoretical and empirical PDFs for small 
values of X are related with the prevailing prices of trades expressed in inte- 
ger values of cents. Obviously we do not account for this discreteness in our 
continuous description. In the empirical power spectrum one-day resonance — 
the largest spike with higher harmonics — is present. This seasonality — an 
intraday activity pattern of the signal — is not included in the model either 
and this leads to the explicable difference from observed power spectrum. 



4 Discussion and conclusions 



In the previous work |23] we provided evidence that long-range memory fluc- 
tuations of trading activity in the financial markets may be considered as the 
background stochastic process responsible for the fractal properties of other 
financial variables. This background stochastic process can be reproduced us- 
ing a nonlinear SDE (|T5l) with multiplicative noise composed of two powers of 
a stochastic variable. The two powers in the SDE reveal different behaviors of 
the market in the periods of different trading activity. In this paper we gen- 
eralized the form of the background SDE within the nonextensive statistical 
mechanics framework to reproduce fascinating statistical properties of the fi- 
nancial variables with a g-Gaussian PDF and fractured behavior of the power 
spectrum. 
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In the prevailing relatively calm periods, with x < 1/e and multiplicativity 
specified by t] = 5/2, markets behave as stationary stochastic processes with 
a g-Gaussian PDF, g ~ 1 + 2/5 = 1.4. In the periods of excited behavior, 
when X > 1/e, the PDF approaches a nonstationary regime, A ~ 2. This leads 
to the excess values of financial variables, which have to be restricted by the 
additional limits in the SDK, term (xe'')^ giving the exponential restriction 
of diffusion at the excess value x ~ l/e"^. These rare escapes of a continuous 
stochastic variable smoothed by an integration procedure do not very con- 
siderably contribute to the main PDF of a financial variable. However, these 
escapes condition the behavior of the power spectrum, reducing the exponent 
/3 of the power law distribution S{f) ~ 1//^ in the region of higher frequen- 
cies. In the case of the return, the background stochastic process defined by 
Eq. (fT5|) is hidden by the secondary high amplitude g-Gaussian stochastic 
process ^{tq, A2}. Though the background fiuctuations are considerably lower 
than the secondary ones, this drives the whole process through the empirically 
defined Eqs. (1T9|) and (|20|). 

The generalized new form of the continuous stochastic differential, equation 
(HM . enables us to reproduce the main statistical properties of the return, 
observed in the financial markets. All parameters introduced are recoverable 
from the empirical data and are responsible for the specific statistical features 
of real markets. The model does capture the distribution of the return, the 
empirical temporal dynamics and correlations evaluated through the power 
spectral density of absolute return. The model definition with two powers of 
multiplicative noise enables us to reproduce the power spectral density with 
two different scaling exponents, as observed in the empirical data. Stochas- 
tic modeling of the financial variables with the nonlinear SDK is consistent 
with the nonextensive statistical mechanics and provides new opportunities to 
capture empirical statistics in detail. 
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